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Please amend claims 1, 10, 19, and 30. Please add new claims 41-46. The claims arc as 
follows: 



1 . (Currently amended) A method for document analysis and retrieval, comprising the following 
steps performed in the order recited; 

transmitting, by a remote host in a first computing system to a web service host in a 
second computing system, a first portion of a document; and 

sequentially transmitting, by the remote host to the web service host, at least one 
additional portion of the document, wherein the first portion and the at least one additional 
portion collectively comprise the entire document, wherein the entire document is adapted to be 
reconstructed and subsequently processed via processing said entire document by die web service 
host, said processing comprising at least one of: 

extracting text from said entire document to configure said text in a text format, i r 

said entire document received by said web service host comprises said text in a non-text 

format; determine 

generating document keys associated with said text from analysis of said text in 
said text format, if said entire document received by said web service host comprises said 
text in said text format, or if said web service host has previously performed said 
extracting such that said text in said text format is available to said web service host; and 

determining, from given categories of a document taxonomy, a set of closet 
closes t categories to the document based on a comparison between the document keys 
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and category keys of the given categories, if said entire document received by said web 
service host comprises said document keys, or if said web service host has previously 
performed said generating such that said document keys arc available to said web service 
host. 

2. (Original) The method of claim 1, further comprising prior to the sending step identifying said 
web services host, said identifying comprising: 

executing a Universal Description, Discovery, and Integration (UDDT) search to identify 
one or more web services hosts who can receive said document in chunks and who can perform 
said at least one of said extracting, generating, and stemming; and 

selecting said web services host from said one or more web services hosts. 

3. (Original) The method of claim 1, wherein said transmitting and sequentially transmitting 
comprise respectively transmitting and sequentially transmitting the first portion and the at least 
one additional portion via Internet transmission to said web service host. 

4. (Original) The method of claim 1, wherein said generating comprises: 

generating tokens of said text such that stop words do not appear in said tokens; and 
stemming said tokens to generate said document keys from said tokens. 

5. (Original) The method of claim 1, wherein said processing comprises said extracting, said 
generating, and said determining. 
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6. (Original) The method of claim 1, wherein said processing consists of two of said 0Xtra 3jj%teg% 
said generating, and said dctenrtining. 

7. (Original) The method of claim 1, wherein said processing comprises said extracting but not 
said generating and not said determining. 

8. (Original) The method of claim 1, wherein said processing comprises said generating but not 
said extracting and not said determining. 

9. (Original) The method of claim 1, wherein said processing comprises said determining but not 
said extracting and not said generating. 
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10. (Currently amended) A system for document analysis and retrieval, comprising a first 
computing system that includes a remote host, wherein the remote host is remote relative to a 
web service host in a second computing system, and wherein the remote host is adapted to: 
transmit a first portion of a document to the web service host; and 
sequentially Ironsmit at least one additional portion of the document to the web service 
host, wherein the first portion and the at least one additional portion collectively comprise the 
entire document, wherein the entire document is adapted to be reconstructed and subsequently 
processed via processing said entire document by the web service host, said processing 

comprising at least one of: 

extracting text from said entire document to configure said text in a text format, if 
said entire document received by said web service host comprises said text in a non-text 
formal; detennine 

generating document keys associated with said text from analysis of said text in 
said text formal, if said entire document received by said web service host comprises said 
text in said text format, or i f said web service host has previously per formed said 
extracting such that said text in said text format is available to said web service host; and 

determining, from given categories of a document taxonomy, a set of closet 
cl osest categories to the document based on a comparison between the document keys 
and category keys of the given categories, if said entire document received by said web 
service host comprises said document keys, or if said web service host has previously 
performed said generating such that said document keys are available to said web service 
host. 
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r ir(Original) The system of claim 10, wherein the remote host is adapted to identify said web 
services ho st by: 

executing a Universal Description, Discovery, and Integration (UDDI) search to identify 
one or more web services hosts who can receive said document in chunks and who can perform 
said at least one of said extracting, generating, and determining; and 

selecting said web services host from said one or more web services hosts. 

12. (Original) The system of claim 1 0, wherein to send transmit and to sequentially transmit 
comprises to respectively transmit and sequentially transmit the first portion and the al least one 
additional portion via Internet transmission to said web service host. 

13. (Original) The system of claim 10, wherein said generating comprises: 

generating tokens of said text such that stop words do not appear in said tokens; and 
stemming said tokens to generate said document keys from said tokens. 

14. (Original) The system of claim 10, wherein said processing comprises said extracting, said 
generating, and said determining. 

15. (Original) The system ofclaim 1 0, wherein said processing consists of two of said extracting, 
said generating, and said determining. 

16. (Original) The system of claim 10, wherein said processing comprises said extracting but not 
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said generating and not said determining. 

17. (Original) The system of claim 10, wherein said processing comprises said generating but not 
said extracting and not said determining. 

18. (Original) The system of claim 10, wherein said processing comprises said determining but 
not said extracting and not said generating. 

19. (Currently amended) A method for document analysis and retrieval, comprising the following 
steps performed in the order recited: 

receiving, by a web service host in a second computing system from a remote host in a 
first computing system, a first portion of a document; 

sequentially receiving, by the web service host from the remote host, at least one 
additional portion of the document, wherein the first portion and the at least one additional 
portion collectively comprise the entire document; 

reconstructing the entire document from the first portion and the at least one additional 

portion; and 

processing the entire document by the web service host, wherein said processing 

comprises at least one of: 

extracting text from said entire document to configure said text in a text formal, if 
said entire document received by said web service host comprises said text in a non-text 
format; 
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generating document keys associated with said text from analysis of said text .in 
said text format, if said entire document received by said web service host comprises said 
text in said text format, or if said web service host has previously performed said 
extracting such that said text in said text formal is available to said web service host; and 

determining, from given categories of a document taxonomy, a set of ctaset 
closest categories to the document, if said entire document received by said web service 
host comprises said document keys, or if said web service host has previously performed 
said generating such that said document keys arc available to said web service host. 

20. (Original) The method of claim 1 9, wherein the web services host is listed in a Universal 
Description, Discovery, and Integration (UDDI) registry as being able to receive said document 
in chunks and being able to perform said at least one of said extracting, generating, and 
dctennining. 

21. (Original) The method orclaim 19, wherein said receiving and sequentially receiving steps 
comprise receiving the first portion and the at least one additional portion via Internet 
transmission from said remote host. 

22. (Original) The method of claim 19, wherein said generating comprises: 

generating tokens of said text such that stop words do not appear in said tokens; and 
stemming said tokens to generate said document keys from said tokens. 
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23. (Original) Hie method of claim 19 S wherein said processing comprises said extracting, said 
generating, and said determining. 

24. (Original) The method of claim 19, wherein said processing consists of two of said 
extruding, said generating, and said determining. 

25. (Original) The method of claim 1 9, wherein said processing comprises said extracting but not 
said generating and not said determining. 

26. (Original) The method of claim 19, wherein said processing comprises said generating but 
not said extracting and not said determining. 

27. (Original) The method of claim 19, wherein said processing comprises said delcrminingbut 
not said extracting and not said generating. 

28. (Original) The method ofclaim 19, wherein said determining comprises: 

comparing the category keys of each category with said document keys to make a 
determination of a distance between the document and each category as a measure of how close 
the document is to each category; and 

determining said set of closest categories based on said determination. 

29. (Original) The method ofclaim 19, wherein said processing comprises said determining, and 
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wherein the method further comprises: 

creating a search string, said search string comprising a logical function of a subset of 

said document keys; 

submitting said search string to a search engine; 

receiving links to related documents from said search engine, said links being based on 

said search string; and 

returning said links to said remote host. 

30. (Currently amended) A system for document analysis and retrieval, comprising a second 
computing system that includes a web service host, wherein the web service host is remote 
relative to a remote host in a first computing system, and wherein the web service host is adapted 
to: 

receive a first portion of a document Irom the remote host; 

sequentially receive at least one additional portion of the document from the remote host, 
wherein the first portion and the at least one additional portion collectively comprise the entire 
document; 

reconstruct the entire document from the first portion and the at least one additional 
portion; and 

implement processing the entire document, said processing comprising at least one of: 

extracting text from said entire document to configure said text in a text format, if 
said entire document received by said web service host comprises said text in a non-text 
format; 
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generating document keys associated with suid text from analysis of said text in 
said text format, if said entire document received by said web service host comprises said 
text in said text fonnat, or if said web service host has previously performed said 
extracting such that said text in said text format is available to said web service host; and 

determining, from given categories of a document taxonomy, a set of eloset 
closest categories to the document, if said entire document received by said web service 
host comprises said document keys, or if said web service host has previously performed 
said generating such that said document keys arc available to said web service host. 

3 1 . (Original) The system of claim 30, wherein the web services host is listed in a Universal 
Description, Discovery, and Integration (UDD1) registry as being able to receive said document 
in chunks and being able lo perform said at least one of said extracting, generating, and 
determining. 

32. (Original) The system of claim 30, wherein to receive and sequentially receive comprise to 
receive the first portion and the at least one additional portion via Internet transmission from said 
remote host. 

33. (Original) The system of claim 30, wherein said generating comprises: 

generating tokens of said text such that stop words do not appear in said tokens; and 
stemming said tokens to generate said document keys from said tokens. 
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34. (Original) The system of claim 30, wherein said processing comprises said extracting, said 
generating, and said determining. 

35. (Original) The system of claim 30, wherein said processing consists of two of said extracting, 
said generating, and said determining. 

36. (Original) The system of claim 30, wherein said processing comprises said extracting but not 
said generating and not said determining. 

37. (Original) The system of claim 30, wherein said processing comprises said generating but not 
said extracting and not jsaid determining. 

38. (Original) Hie system of claim 30, wherein said processing comprises said determining but 
not said extracting and not said generating. 

39. (Original) The system of claim 30, wherein said determining comprises: 

comparing the category keys of each category with said document keys to make a 
determination of a distance between the document and each category as a measure of how close 
the document is to each category; and 

determining said set of closest categories based on said determination. 

40. (Original) 'Hie system of claim 30, wherein said processing comprises said determining, and 
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wherein the method further cornfimcs:** 

creating a search siring, said search siring comprising a logical function of a subset of 
said document keys; 

submitting scud search siring to a search engine; 

receiving links to related documents from said search engine, said links being based on 
said search string; and 

returning said links to said remote host. 

41. (New) The system of claim 1, wherein said determining comprises: 

comparing the category keys of each category with said document keys to make a 
determination of a distance between the document and each category as a measure of how close 
the document is to each category; and 

determining said set of closest categories based on said determination. 

42. (New) The method of claim 41 , wherein said comparing comprises computing said distance 
for each category as a dot product of a vector of the document keys and a vector of the category 
keys of each category. 

43. (New) The system of claim 10, wherein said determining comprises: 

comparing the category keys of each category with said document keys to make a 
determination of a distance between the document and each category as a measure of how close 
the document is to each category; and 
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determining said set of closest categories based on said determination. 

44. (New) The system of claim 43, wherein said comparing comprises computing said distance 
for each category as a dot product of a vector of the document keys and a vector of the category 
keys of each category. 

45. (New) The method of claim 28, wherein said comparing comprises computing said distance 
for each category as a dot product of a vector of die document keys and a vector of the category 
keys of each category. 

46. (New) The system of claim 39, wherein said comparing comprises computing said distance 
for each category as a clot product of a vector of the document keys and a vector of the category 
keys of each category. 
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